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Abstract 

Given a finite word u, we define its palindromic length |tt| pa i to be the least number n 
such that u = v\Vi . . . v n with each Vi a palindrome. We address the following open question: 
Does there exist an infinite non ultimately periodic word w and a positive integer P such 
that |«| pa i < P for each factor u of wl In this note we give a partial answer to this question. 
Let k be a positive integer. We prove that if an infinite word w is fc-power free, then for each 
positive integer P there exists a factor u of w whose palindromic length \u\ pa \ > P. We also 
extend this result to a wider class of words satisfying the so-called (k, /)-condition, which 
includes for example the Sierpinski word. 

1 Introduction 

Let A be a finite non-empty set, and let A + denote the set of all finite non-empty words in A. 
A word u = U\U2 ■ ■ ■ u n G A + is called a palindrome if u\ = u n _j+i for each i = 1, . . . , n — 1. In 
particular each a G A is a palindrome. We also regard the empty word as a palindrome. 

Palindrome factors of finite or infinite words have been studied from different points of view. 
In particular, Droubay, Justin and Pirillo [3] proved that a word of length n can contain at 
most n + 1 distinct palindromes, which gave rise to the theory of rich words. The number of 
palindromes of a given length occurring in an infinite word is called its palindrome complexity 
and is bounded by a function of its usual subword complexity [T]. However, in this paper we 
study palindromes in an infinite word from the point of view of decompositions. 

For each word u G A + we define its palindromic length, denoted by |w| pa i, to be the least number 
P such that u = V\V2 - ■ ■ Vp with each a palindrome. As each letter is a palindrome, we 
have |tt| P ai < \u\, where \u\ denotes the length of u. For example, |01001010010| pa i = 1 while 
|010011| pa i = 3. Note that 010011 may be expressed as a product of 3 palindromes in two 
different ways: (0)(1001)(1) and (010)(0)(11). In [9j, O. Ravsky obtains an intriguing formula 
for the supremum of the palindromic lengths of all binary words of length n. 

The question considered in this paper is 

Question 1 Does there exist an infinite non ultimately periodic word w and a positive integer 
P such that | it | pal < P for each factor uofw? 
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We conjecture that such aperiodic words do not exist, but at the moment we can prove it only 
partially. Namely, in this paper we prove that if such a word exists, then it is not fc-power-free 
for any k and moreover, for all k > 1, I > it does not satisfy the (k, /)-condition defined in 
Section [4j A discussion what exactly the condition means and which class of words should be 
studied now to give a complete answer to the question is given in Section [5] 

A preliminary version of this paper has been reported at Journees Montoises 2012. 



2 The case of /c-power-free words 

Let k be a positive integer. A word v 6 A + is called a k-power if v = u k for some word u 6 A + . 
An infinite word w = w\W2 . . . G A N is said to be k-power-free if no factor u of w is a /c-power. 
For instance, the Thue-Morse word 0110100110010110 . . . fixed by the morphism ^ 01, 1 H> 10 
is 3-power free [6]. 

Theorem 1 Let k be a positive integer and w = w\W2 ■ ■ ■ £ ^4 N . // w is k-power free, then for 
each positive integer P there exist a prefix u of w with |-u| pa i > P. 

Recall that a word u± ■ ■ ■ u n is called t-periodic if Ui = Ui+t for all 1 < i < n — t. 
The proof of Theorem [T] will make use of the following lemmas. 

Lemma 2 Let u be a palindrome. Then for every palindromic proper prefix v of u, we have that 
u is {\u\ — \v\) -periodic. 

Proof. If u and v are palindromes with v a proper prefix of u, then v is also a suffix of u and 
hence u is (|n| — \v |)-periodic. □ 

In what follows, the notation u>[i..j] can mean the factor WiWi+i ■ ■ ■ Wj of a word w = w\ ■ ■ ■ w n ■ ■ ■ 
as well as its precise occurrence starting with the position numbered i; we always specify it when 
necessary. 

Lemma 3 Suppose the infinite word w is k-power-free. Ifw[ii..i2] andw[i\..i^\ are palindromes 
with i% > i2, then 

\w[h..i3}\ > 1 1 n 
IMu--&2]| k — 1 

Proof. By Lemma[2j the word u^ii..^] is (i% — ^-periodic; at the same time, it cannot contain 
a &-power, so, |io[ii..i3]| < k(iz — 12). Thus, 

|io[ii.i 3 ]| |to[ii..i 3 ]| ^ mii..i 3 ]| _ ^ | 1 ^ 



w 



'ii..i 2 ]\ |«;[ii..t3]|-(t3-i2) (1- I) (k[*i"*s]|) " k ~ 1 ' 



Lemma 4 Let N be a positive integer. Then for each i > 0, the number of palindromes of the 
form w[i..j] of length less or equal to N is at most 2 + log k ^ k _^ N. 

Proof. For each i > 0, the length of the shortest non-empty palindrome beginning in position 
i is equal to 1. By the previous lemma, the next palindrome beginning in position i is of length 
more than and the one after that is of length more than (^j) 2 , and so on. The longest one 
is of length at most ./V but more than so that P < log fc /^_^ N, and the total number 

n + 1 of such words is at most 1 + log k ^ k _^ N. Adding the empty word which is a palindrome 
gives the desired result. □ 
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Proof of Theorem [TJ Fix a positive integer P and let N be a positive integer satisfying 

(2 + logfc/^D iV) p < AT. 

By the previous lemma, the number of prefixes of w of the form v\V2 ■ ■ - vp, where each v\ is 
a palindrome, of length less or equal to N is at most (2 + log k ^ k _^ N) p ', and hence at most 
iV. But u; has A^-many non-empty prefixes of length less or equal to N. This means that there 
exists a prefix u of w of length less or equal to N such that |u| pa i > P. □ 

3 Privileged words and other regularities 

In fact, the proof above does not use directly any properties of palindromes except for Lemma|3j 
So, analogous statements on the properties of /c-power-free words can be proved for any other 
type of word regularities for which lemmas analogous to Lemma [3] hold. In particular, we can 
almost immediately extend Theorem [T] to privileged words. 

Privileged words have been introduced by J. Kellendonk, D. Lenz and J. Savinien [5]; they are 
studied also in [8]. Privileged words are defined recursively as follows: first, the empty word 
and each element a E A is privileged. Next, a word u € A + with |u| > 2 is privileged if 
and only if it is a complete first return to a shorter non-empty privileged word, i.e., if there 
exists a non-empty privileged word v which is both a proper prefix and a proper suffix of u and 
which occurs in u exactly twice. For example, 00 is privileged as it is a complete first return 
to the privileged word 0. Similarly, 00101100 is privileged as it is a complete first return to the 
privileged word 00. This latter example shows that a privileged word need not be a palindrome. 
Conversely, the palindromes 1231321 and 00101100110100 are not privileged as neither word is 
a complete first return. However, if a word w is rich meaning that it contains + 1 factors 
which are palindromes, then each non-empty factor of w is a palindrome if and only if it is 
privileged (see Proposition 2.3 in [5]). Thus analogously we define the privileged length of a 
word u G A + , denoted |it| pr i V to be the least number n such that u = viv% ■ ■ -v n with each Vi 
a privileged word. Again we have the inequality |u| pr iv < \u\- For instance, |00101100| pr i v = 1, 
while |00101100110100| priv = 3. We note that 00101100110100 may be written as a product of 
3 privileged words in more than one way: (0)(010110011010)(0) or (00)(1011001101)(00). 

The following lemma is analogous to Lemma [2} 

Lemma 5 Let u be a privileged word. Then for every privileged proper prefix v of u, we have 
that u is (|ii| — | v |) -periodic. 

Proof. Suppose u and v are privileged words with v a proper prefix of u. We will prove that 
v is also a suffix of u. We proceed by induction on \u\. The result is vacuously true for \u\ = 1. 
Next suppose \u\ > 1. Then a is a complete first return to a privileged word u' with \u'\ < \u\. 
We claim that \v\ < \u'\. In fact, suppose to the contrary that \v\ > \u'\. Then u' would be a 
proper prefix of v and hence by induction hypothesis u' is also a suffix of v. This means that u' 
occurs at least three times within u (as a prefix of v, as a suffix of v and as a suffix of u). This 
contradicts that u is a complete first return to v! . Having established that \v\ < \u'\, it follows 
that v is a suffix of u' . In fact, if \v\ = \u'\, then v = v! while if \v\ < \u'\, then by induction 
hypothesis v is a suffix of v! . As v! is a suffix of u we obtain that v is a suffix of u as required. 
Whence, u is (\u\ — |-u|)-periodic. □ 

Now using Lemma [5] we can prove the following lemma. 

Lemma 6 Suppose the infinite word w is k-power-free. If w[i\..i2\ and w\i\..i^\ are privileged 
words with i% > i<i, then 

> 1 i 

mii..i 2 ]| ~ k—1' 
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Proof. By Lemma[5j the word io[ii.-*3] is (is — Z2)-periodic; the rest of the proof is completely 
analogous to that of Lemma [3j □ 

Now, using this lemma, we analogously to Theorem [T] prove the following 

Theorem 7 Let k be a positive integer and w = w\W2 ■ ■ ■ € ^4 N - If w is k -power free, then for 
each positive integer P there exist a prefix u of w with \u\ pT i v > P. 

Instead of privileged words, we could use words of any other type for which a statement analogous 
to Lemmas [3] and [6] would hold. However, the proof of the next more general statement uses 
substantially the properties of palindromes and not any other family of words. 



4 The case of the (k, /)-condition 

Recall that a fractional power w p ^ q of a word w whose length \w\ is divisible by q is defined 
as the word w^'^w', where w' is the prefix of w of length To state the next, more 

general case for which we can prove the unboundedness of the palindromic length, let us fix some 
integer k > and define a k-run in a word w as follows: a £;-run is an occurrence w [i-j] such 
that the word w is a fc'-power for some (possibly fractional) k! > k, but neither w[i — 
nor w[i..j + 1] are &:"-powers for any k" > k. 

Several problems on the maximal number and the sum of exponents of runs in a finite word 
have been studied, e. g., in [21 13]. 

Example 1 The word v = 1(100) 5 101(001) 7 11 has two 5-runs, namely, u[2..18] = (100) 17 / 3 and 
u[18..4Q] = (010) 23 / 3 . They are also 2-runs, 3-runs and 4-runs; the latter is also a 6-run and a 
7-run. There are no 8-runs in v. 

Note that A;-runs are defined as occurrences of words, or, in fact, as pairs of positions corre- 
sponding to their beginnings and ends. So, we can say that a /c-run w[i..j] covers a position x 
of the word w if i + 1 < x < j — 1. Clearly, a position can be covered by an arbitrary number 
of /c-runs, and even by an infinite number of them. 

Example 2 The third letter in the word v = 00010010010100100101001001 is covered by three 
3-runs: v[1..3] = 3 , u[2..11] = (001) 10 / 3 and u[3..26] = (01001001) 3 . 

The position 1 in the infinite word defined as the limit of the sequence {si}^ =1 , where s\ = 0001, 
Sj+i = s 3 l, is covered by an infinite number of 3-runs. 

Let us denote the number of /c-runs covering position n in a word w by r w ^(n) or simply by 
rfc(n) if w is fixed in advance. The maximal value of rfe(n) for i < n < j is denoted by rk[i..j}- 

We say that an infinite word w satisfies the (k, l)-condition for some k > 2 and I > if it is not 
ultimately periodic and r w fe(n) < I for all n, that is, if each position n in w is covered by at 
most I many /c-runs. 

Example 3 The Sierpinski word w s = 0101110 1 01 9 0101110 1 01 27 • • • , defined as the fixed point 
starting with of the morphism ip : i— > 010, 1 h-> 111, satisfies the (3, l)-condition. Indeed, the 
only primitive factor u whose powers at least 3 occur in w s is 1, and thus there is at most one 
3-run covering each position in it. 
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Remark 1 We have not managed to find a proof of this statement in the literature, but it 
seems very believable that for any morphic word w there exists some k such that there exists 
only a finite number of primitive words whose powers greater than k occur in w. If it is true, it 
means almost immediately that all the morphic words satisfy a (k, l)-condition for some k, and 
thus that all aperiodic morphic words have unbounded palindromic length of factors. 

The following theorem is a generalization of Theorem [T] which corresponds to the particular case 
of I = 0. Note that it is stated for a factor of the word w, not for its prefix. 

Theorem 8 If an infinite word w satisfies the (k, I) -condition for some k > 2 and I > 0, then 
for each given P > it contains a factor u with \u\ pa \ > P. 

The remaining part of the section is devoted to the proof of this theorem. The scheme of the 
proof is the following. In the first part of the proof we are going to introduce a new measure of 
words, so that the ratio of measures of two palindromes starting from a point is at least 1 + rz-j 



for some constant C (Lemma 19). In the second part of the proof we choose a factor of big 
enough measure and deduce that the factorizations of prefixes of this factor into P palindromes 
cannot cover all the prefixes. Hence we derive the existence of a factor with palindromic length 
greater than P. Though the general idea of the proof is similar to the case of /c-power free words 
(with measure instead of length) , the proof is much more technical. 

First of all, note that if there exist arbitrarily long parts u>[i..j] of w with rk[i..j] = 0, then we 
can proceed as in the proof of Theorem [T] and find a prefix of palindromic length greater than P 
in any factor w[i..i + N] of w such that r&[i..i + N] = and (2 + log fc /( fc _ 1 ) N) p < N. So, in the 
main case of the theorem the length of factors of w not intersecting with any fc-runs is bounded, 
and in particular w contains an infinite number of fc-runs. So, from now on we assume that w 
satisfies this condition. 

Let us say that a fc-run w[i..j] is an upper /c-run in w if it is not covered by another fc-run, that 
is, if there is no fc-run w[i'..f] not equal to it such that %' < i < j < f. For each position n in 
w, we define by mfe(n) = m(n) the number of upper /c-runs of the form w[n..i] or w[i..n]. 

Example 4 Consider the word v = 11(1000) 3 0. In it, v[1..3] = 111, u[3..14] = (1000) 3 and 
u[12..15] = 0000 are upper 3-runs, whereas the 3-runs t>[4..6] = 000 and v[8..10] = 000 are 
not upper since they are covered by w[3..14] = (1000) 3 . Also, we have m(3) = 2, m(l) = 
m(12) = m(14) = m(15) = 1 and m(n) = for any other n; here m{n) is exactly the number of 
occurrences of the position n in the notation of the upper 3-runs, which are v[1..3], v[3..14] and 
u[12..15]. 

Lemma 9 For each n we have < m(n) < 2. 

Proof. At each position n of w there is at most one upper A;-run beginning in position n and 
at most one upper fc-run ending in position n, so, < m(n) < 2. □ 

Now let us define the measure m[i..j] of an occurrence w[i..j] as the sum 

3 

m[i..j] = m(n). 

n=i 

Clearly, it is indeed a measure, that is, the function m[i..j] is non-negative, equal to for the 
empty word w[i..i — 1] for any i, and the measure of a disjoint union is the sum of measures, 
which means in particular that 
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Lemma 10 For all i\ < 12 < 13 we have 

m[h..i3] = m[ii..i2] + mfa + 1.-23]. □ 

Note also that the function rk[i..j] is defined as the maximum of r^(n) for n 6 {i, . . . , j}, and is 
uniformly bounded by I due to the {k, Z)-condition, whereas m[i..j] is defined as the sum of m{n) 
and is not uniformly bounded since otherwise w would contain a finite number of fe-runs. Recall 
that we assume that the length of factors of w not intersecting with any /c-runs is uniformly 
bounded, since otherwise we simply apply the proof of Theorem [Tj This gives us 

Lemma 11 There exists a unique V , 1 < V < I, such that 

• there exists some M > such that m[i..j] > M implies rk[i-.j] > I' ; 

• for all L there exist i,j such that m[i..j] > L and rk[i..j] < □ 

In the remaining part of the proof we shall always consider (occurrences of) factors of w with 
fk\}--j] < due to the lemma above, their measure can be arbitrarily large. Due to the same 
lemma, each part of such a word whose measure is at least M must contain a position covered 
by exactly /' of k-ixms. 

Let us say that a k-rxva. w\j\..j?\ is an internal run within an occurrence w \pi--P2] if it intersects 
it but does not cover positions p\ or P2, that is, if p\ < j\ < 32 < P2- Similarly, a k-run 
w\j\..j2] is called a left /c-run in a word w\p\..p2\ if it covers the position p\ but not P2, that is, 
jl < Pi < 32 < P2\ symmetrically, it is a right A;-run if it covers the position P2 but not p\, that 
is, p\ < ji < P2 < J2- At last, it is a covering fc-run if it covers both ending positions, that is, if 

jl <Pl<P2< J2- 

Lemma 12 Each k-run intersecting with an occurrence w[p\..p2} is either internal, or left, or 
right, or covering. 

Proof. The four cases are determined by two facts: if the symbols w Pl and w P2 are parts of 
the fc-run. □ 

Lemma 13 If w\p\..p2\ = w\q\..q2[ and w\p\ + 1..P2 — j] for some i,j>0 is an internal k-run 
for w\p\..p2], then w[q± + i..q2 — j] is an internal k-run in w[q\..q2\- However, the sets of left, 
right and covering runs depend on an occurrence of a word, and thus can be completely different 
for w[p\..p2] and for w\q\..q2\. Moreover, if ' w\pi + i..p2— j] is an upper k-run, it does not imply 
that w[q± + i..q2 — j] is an upper k-run, and vice versa. 

Proof. If w\p\..p2\ = 10 [91.. 92] = u = u\ - ■ -u n , where n = P2 — pi + 1 = 92 — 9i + 1) then 
w\pi + i--P2 — j] = u i+i • • • u n -j = v. By the maximality in the definition of a fc-run, we see that 
the symbols Ui and u n -j+i break the periodicity of v, so the /c-run always starts at the symbol 
number i + 1 of u and ends at its symbol number n — j + 1. 

To give an example for the second part of the statement, consider the word 

aba.bb.aba.a k ~ 2 .aba.bb.(aba) k cw' , 

where the infinite word w' is on the alphabet {b, c}. We see that in the first occurrence of aba in 
it, there are no left, right or covering fc-runs; in the second one, there is a right k-run a k , which 
is also a left run for the third occurrence of aba; and the fourth occurrence of aba is a prefix of 
a fc-run {aba) k covering it. 
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At last, one occurrence of a word can be an upper internal /c-run whereas another occurrence is 
not an upper one. As an example, consider the word 



(ba k b) k bb ba k bcw', 

where w' does not contain the symbol a. We see that in the first k occurrences of ba k b the /c-runs 
a k are covered by the /c-run (ba k b) k b, and in the last one, the /c-run a k is an upper one. □ 

To state the next lemma, symmetric to the first part of the previous one, we denote by v the 
mirror image v n v n -i ■ ■ ■ v\ of a word v = v\ ■ ■ ■ v n . In particular, a palindrome is exactly a word 
v such that v = v. 

Lemma 14 If w\p\..p2[ = v and w\q\..q 2 \ = v, and if w\p\ + 1..P2 — j] for some i,j > is an 
internal k-run within w[p\..p2\, then w[qi + j--qi — i] is an internal k-run in w[q\..q<i\. 

Proof. It is sufficient to realize that if is an internal /c-run within v = v± ■ ■ -v n , then 

v[n — j + l..n — i + 1] is an internal /c-run withing v = v\ ■ ■ ■ v n = v n ■ ■ ■ v\. □ 

Lemma 15 The number of left (resp., right) k-runs for w[p\..p2\ is bounded by rrp\) (resp., 
rk{P2)) and thus by r k [pi..p 2 ]. 

Proof. The /c-runs covering the position p\ are left or covering runs for w\p\..p2\, and their 
total number is {p\ ) . Symmetrically, the A;- runs covering the position p 2 are right or covering 
runs for w\p\..p2\, and their total number is rk(j>2)- D 

Lemma 16 The set of covering runs for w\p\..p2\ can be non-empty only if m\pi..p2\ < 2r^[pi..p2]- 

Proof. Suppose that there is a fc-run covering w [pi.. p2\- Then every fc-run w[q\..q2[ contributing 
to the measure of w\p\..p2[ satisfy the following condition: one of the values q±, qi lies inside 
the interval [P1..P2]) the other one outside. In other words, either p\ < q\, p\ < q 2 < P2, or 
Pi < Qi < P2 , P2 < Q2 ■ The number of A;-runs satisfying the first condition is at most r& (p\ ) , the 
number of fc-runs satisfying the second condition is at most ^(j^), and each of them contributes 
at most 1 to the measure. Therefore, m\p\..p2\ < ^fc(pi) + ^fc(P2) < 2r/c[pi..p2]- D 

The following two "lemmas of inviolable parts" are crucial for the proof. The first one is stated 
for an occurrence of v and the second one for an occurrence of v, since it is what we need further 
in the proof, but in fact both of them can be stated both for v and for v. 

Lemma 17 If wlii.Az] = v, r&^'i..^] = I', and m[ii..i 2 ] > M + 41', where the parameters I' 



and M are defined in Lemma 11, then m\j\..j<j\ > 2 for all ji,j2 such that w\j\..j2[ = v and 
rk[h-h] < I'- 



Proof. Due to Lemma 16 there are no /c-runs covering w[i\..i2\- Consider the maximal 



qi G . . . ,i 2 — 1} such that wfaj.-gi] is a left fc-run for lyfzi..^]; this fc-run is a covering one 



for w[ii..qi], and thus due to Lemma 16, m[ii..q , i] < 2rk[ii-.qi] < 2rfe[ii..«2] = 2i'. If there are 



no left /c-runs in w[ii..i2], we put q\ = i\ — 1, and thus m[i\..qi\ = since it is an empty word. 



So, due to Lemma 10, we have m[q\ + l.-i 2 ] = m^i..^] — fn[ii--Q.i\ > M + 21'. 



Now symmetrically, due to Lemma 16, there are no /c-runs covering w[q\ + l..^]. Consider the 
minimal q2 E {i\ + 1, . . . ,^2} such that w[q2--y] is a right /c-run for w[i\..i2\- I n fact we have 
Q2 > Qi + 2 since otherwise w[q2~y] is a covering /c-run for w[q\ + l..^], a contradiction. So, the 
/c-run w[q 2 ..y] is a right /c-run for w[qi + l..i 2 ] and a covering one for w[q2--i2], and thus due to 
Lemma [l6j m[q2-i 2 ] < 2rj c [q 2 ..i2\ < 2r k [ii--i2] = 21' . If there are no right /c-runs in w[i\..i2], we 
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<=21 ' <=M+1 >=C <=M+1 <=21 ' 




inviolable part 



Figure 1: Proof of Lemma 18 



put q 2 = %2 + 1, and thus m[?2-*2] = since it is an empty word. So, due to Lemma 10, we have 
m[qi + l..q 2 - 1] = m[qi + l..i 2 ] - mfe--^] > M. 

Consider the occurrence w[q\ + 1..92 — 1]- By the construction, all /c-runs covering it are internal 
fc-runs for tofii..^]. At the same time, m[qi + l..q 2 — 1] > and thus by the definition of 



M in Lemma 11 there is a position i\ + n € {gi + 1, . . . , 92 — 1} with rp.(ii + n) = I'. But 
due to Lemma 14, all the I' internal /c-runs for w[i\..i 2 } = v covering the position i\ + n have 
analogues in w\j\..j 2 \ = v which cover the position j'2 — n 6 {ji, . . . ,j 2 }- Due to the condition 
fk[ji--32\ < I', at least one of these I' runs internal for w{j\..j 2 \ and covering the position j 2 — n 
is an upper run in w, and thus it contributes 2 to m\j\..j^. □. 

Lemma 18 Tfwfii..^] = r^fii..^] = Z', andm[i\..i2\ = 2M+4/'+2+C, where the parameters 



V and M are defined in Lemma 11, and C is some positive constant, then m\ji..j2] > C for all 



ji,j 2 such that w[j 1 ..j 2 \ = v and r k [j 1 ..j 2 ] < I' '■ 

Proof. As in the previous lemma, there are no fc-runs covering w[i\..i 2 ]] and after we cut from 
w[ii..i 2 ] the longest prefix w[i\..q\\ covered by some left fc-run and the longest suffix w[q2--i 2 \ 
covered by some right fc-run, we get a factor w\q\ + l..q 2 — 1] of measure m[qi + l-.q 2 — 1] > 
2M + 2 + C such that all /c-runs intersecting with it are internal fc-runs for w[i\..i 2 }. We illustrate 
the proof of Lemma 18 by Fig.[4j The "inviolable part" shown there is the word w[ii+ni..ii+n 2 ], 



whose measure m[i\ + n\..i\ + n 2 \ does not depend on an occurrence of v to w. 

Consider the minimal prefix w[q\ + L.pi] and the minimal suffix w\p 2 ..q 2 — 1] of w[q\ + l..q 2 — 1] 
such that m[qi + l..pi] > M and m[p 2 ..q 2 — 1] > M. Due to Lemma[9l we have m[q\ + l..pi] < 



M + 1 and m\p 2 ..q 2 — 1] < M + 1, and thus due to Lemma 10, m[pi + l..p 2 — 1] > C. At the 
same time, due to the definition of M, there are positions i\ + n\ in w[q\ + l..pi] and i\ + n 2 in 
w[p 2 ..q 2 — 1] such that rk(h + n\) = rk(i\ + n 2 ) = I'. All the I' runs contributing to rk(h + n\) 
(resp., rk(ii + n 2 )) are internal /c-runs for «;[ii..i2] and thus do not depend on the occurrence of 
v = w[i\..i 2 ]. So, in another occurrence v = w[j\..j 2 \ of v we have rk(ji +111) = rk(ji + n 2 ) = I', 
and all the /c-runs contributing to r k {ji +ni) (resp., rjt(ji + ^2)) are internal A;-runs for w[ji..j 2 ]. 
So, there can be only internal fc-runs for w[ji..j 2 ] which intersect w[j\ + n\..j\ + n 2 ] and thus 
affect its measure. So, m[ji..j 2 ] > m\ji+ni..ji+n 2 ] = m[i\ +n\..i\ +n 2 ] > m\p\ + l..p 2 — 1] > C. 

In Fig. |4j the "inviolable part" shown there is the word w[i\ + m..ii + n 2 ], whose measure 
m[i\ + n\..i\ + n 2 ] does not depend on an occurrence of v in w. 

□ 

The following fact is analogous to Lemma [3] 

Lemma 19 Suppose that an infinite word w satisfies the (k, I) -condition, and the constants I' 



and M are defined by Lemma 11, If w[ii..i 2 ] and w[ii..is] are palindromes, where is > i 2 , 
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Th[h-h] < I' and m[i 1 ..i 2 ] > (k - 1)(4/' + 2M + 3), then 

m[i\..h] 



> 1 + 



m[h..i 2 } ~ (k- l)(4l' + 2M + 3) ' 

PROOF. By Lemma[2j the word lo^'i.-ia] is (is — ^-periodic. Denote its suffix w[i 2 + l--^] by 
v and define k' so that w[ii..i 2 ] = v'v k ~ 2 (and thus u;[ii..i3] = v'v k -1 ) for some suffix v' of v 
not equal to v (here v' can be empty) and for some k' > 2. Then k' < k since otherwise 10^1.^3] 



would have been covered by some /c-run which is impossible due to Lemma 16 



Suppose that m[ii..i 2 ] = (k — 1)(4Z' + 2M + 2 + C) for some constant C. Here by the assertion 



we have C > 1; note that C can be non-integer. Due to Lemma 10, the measure of w;[ii..i2] is 
the sum of measures of its prefix v' and of k! — 2 occurrences of v. By the pigeon-hole principle, 
it means that some of these occurrences of v (or of v') have measure at least 41' + 2M + 2 + \C\ . 



It means by Lemma 18 that the measure m[i 2 + l..*3] of another occurrence w[i 2 + l--^] of v is 



at least C. So, due to Lemma 10 again, we have 



m[ii..i 3 ] _ 1 m[i 2 + 1..^] > 1 C 



m[*i..i 2 ] m[ii..i 2 ] ~ (A; - 1)(4/' + 2M + 2 + C) 

The right hand side of this inequality is a growing function of C for C > 1, so the minimal value 
of C = 1 gives its minimum, and we have 

m[il " i3] >1 + 1 • □ 

m[h..i 2 ] ~ (k- 1)(4/' + 2M + 3) 



Now we proceed to the second part of the proof, where we are going to use Lemma 19 to 
prove that there should be a factor with palindromic length greater than P. The sketch of the 
remaining part of the proof is the following. We assign to each factor w[i..j] its code C[i..j], 
which is a word on a binary alphabet. We code each factorization P of a prefix of 
into palindromes by a code C*([i..j],¥), which is obtained from C[z..j] by inserting symbols to 
positions between the palindromes. After that, taking a factor w[i..j] of big enough measure 
(the required measure depends on P and several parameters of the word), we obtain that the 
number of possible codes of factorizations of prefixes of w[i..j] into at most P palindromes is 
less than the length of the code C[i..j]. We deduce from that the existence of a prefix of w[i..j] 
not decomposable to P palindromes. 

Consider a factor w[i..j] with rk[i..j] < V of big enough measure ("long" factor). The required 
measure iV is determined below by ([2]). Define the code C[i..j] as a word on the alphabet {1, 2} 
obtained from the word m(i)m(i + 1) • • • m(j) by erasing the symbols equal to 0. The length of 
the code C[i..j] is denoted by c[i..j]; clearly, 

m[i..j]/2 < c[i..j] < m[i..j], (1) 

since in fact m[i..j] is the sum of the symbols of C[i..j], and each of them is equal to 1 or 2. 

If i and j are fixed, consider the word m(i)m(i + 1) • • • m(j) £ {0, 1, 2}* and denote by rih the 
position giving the hth non-zero symbol in it, so that m(n^) G {1, 2} for all h = 1, . . . , c and 
m(n) = for all other n E {i, % + 1, . . . ,j}\{ni, n 2 , ■ ■ ■ , n c }. We also define no = i — 1. Due to 
this definition, we have C[i..j] = a\ ■ ■ ■ a c with ah £ {1,2} and m(nh) = ah for all h = 1, . . . , c. 

Example 5 Consider the prefix io[1..25] = 0101110101111111110101110 of the Sierpinski word 
w, considered as a word satisfying the (3, l)-condition. Its code is C[1..25] = 111111, and the 
positions rii,...,riQ are equal to 4,6,10,18,22,24: these are exactly first and last positions of 
3-runs in w, that is, of "long" powers of 1. We also fix no = 0. 
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Consider a factorization into palindromes of a prefix w[i..i'} of w[i..j}: w[i..i'] = w[io + l.Ai]w[i x + 
I.A2] • ■ ■ w[i p -i + l.Ap], where i = io + 1, i p = i' , and each word w[id + l.Ad+i] is a palindrome. 
Define the partition to palindromes P as the sequence P = {io,i±, . . . ,i p }, and the code of 
this partition in as the word C* (\i..j],i\, ...,i p ) = C*([i..j],F) on the alphabet {1,2,*} 

obtained from the code Cfi.j] by adding a star before the symbol for each d such that 
n h-i < id < n h- Note that the number of stars in the code of a partition to P palindromes is 
exactly P + 1. 

Example 6 Continuing the previous example, consider the following partition to palindromes 
of a prefix of length 22 of the Sierpinski word w, considered in its turn as a prefix of u)[1..25]: 

(010)(11)(10101111111110101)110. 

We see that io = 0, i\ = 3, 12 = 5, £3 = 22, P = {0, 3, 5, 22}, and thus the code of this partition 
is C([1..25],3,5,22) = C([1..25],P) = * * 1 * 1111 * 1. 

The stars correspond to the boundaries of palindromes, and the symbols 1 and 2 correspond to 
beginnings and ends of upper fc-runs in w[i..j]. Note that the code of a partition of a prefix of 
w[i..j] always begins with a star. 

We are going to prove that if c[i..j] is large enough, then there is a pair of consecutive symbols 
from {1, 2} in C[i..j] such that no partition P of a prefix of w[i..j] to at most P palindromes has 
a star between them in C*([i..j'],P). In particular it means that there is a prefix of w[i..j] with 
palindromic length greater than P, and thus we prove Theorem [8j 

Lemma 20 Suppose that for the word w[i..j] considered above we have c[i..j] = N and i' , 
i < i' < j is a position such that i' = — 1 for some h, 1 < h < N . Then the number of 
values of h' such that there exists i" with w[i' + l.A"] being a palindrome and n^'-i < i" < , 
is bounded by H = D x + 1 + log D2 (2iV/L>i), where D 1 = (k — 1)(4/' + 2M + 3) and D 2 = 1 + ^ . 

Proof. The fact that c[i..j] = N means in particular that m[i..j } < 2N due to 0. We shall 
estimate the number of possible values of m[i' + l.A"] not exceeding 2N, where w[i' + l.A"] is 
a palindrome, and this will give an upper bound for the number of values of h! , since rih are 
exactly the positions where the measure changes. 

First of all, m[i' + l.A"] can take at most (k - 1)(4/' + 2M + 3) + 1 = D x + 1 values of h' 
from to (k — 1)(4/' + 2M + 3). Due to Lemma [l9j the value of h' numbered D\ + 2 must be 
equal at least to D1D2, where D2 = 1 + (fc_i)(4;/ 1 + 2M+3) = 1 + tj^ ; and the value of h' number 
Di + n + 1 is equal at least to DiD^. Even for the maximal n we should have D\D^ < 2iV, so 
that n < \og£) 2 (2N/Di), and the total possible number of measures of palindromes is bounded 
by D 1 + l + l ogD2 (2iV/£>i). □ 

Lemma 21 Suppose that for the word w[i..j] considered above we have c[i..j] = N and i' , 
i < i' < j is a position such that n^i < %' < nh for some h, 1 < h < N . Then the number of 
values of h' , such that there exists i" with w[i' + l.A"] being a palindrome and n^/^i < i" < , 
is bounded by (M + 4/')(log D2 (2iV/Di)) + D 3 , where, as above, D x = (k - 1)(AV + 2M + 3), 
D 2 = 1 + j^, and D 3 = (M + Al') 2 + M + 41' + Dx + 1. 

Proof. As in the previous lemma, we shall estimate the number of possible values of m[i' + 1.A"] 
not exceeding 2N, where w[i' + l.A"] is a palindrome. As above, m[i' + l.A"] can take at most 
{k - 1)(4Z' + 2M + 3) + 1 = Di + 1 values from to (k - 1)(4/' + 2M + 3) = D x . 
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Suppose now that m[i' + > D\, and consider the prefix w[i' + l..n^ — 1] of w[i' + note 
that by the definition of the sequence {n^}, its measure m[i' + L.n^ — 1] = 0, and the measure 
of w[rih--i"] is equal to that of w[i' + l.i"]. 

Suppose first that — i' — 2 > i" — n^, that is, that w[i' + l-.n^ — 1] = t contains the center of 
the palindrome w[i' + = u. It means that u = tv = vt'v and t = vt' for some words v and 
tf. We see that the measure of the occurrence of v starting from i! + 1 is equal to 0, and thus, 



due to Lemma 17 we have m[i' + = m[nh--i"] < M + 4Z', a contradiction to the fact that 
m[i' + > Di. 

Now suppose that nh — i' — 2 < i" — n^, that is, w[i' + l..rt/j — 1] = t is shorter than a half of 
+ = u. It means that u = tvt for some palindrome v starting from the position n^. 
Let us denote the measure of v by m. The measure of the prefix occurrence of t is here equal to 



0; so, due to Lemma 17, the measure of the suffix occurrence of t is at most M + AV — 1. So, we 
have m < m[i' + < m + M + 41' — 1, so, for each possible value of m > -Di — (M + 4/'), the 
measure m[i' + takes at most M + 4Z' different values. Adding possible values from to 
Di , we see that the total number of values is bounded by Di + 1 + (M + 41') (H - D\ + M + 41') , 
where -ff is the bound for the number of measures of palindromes starting from obtained in 



Lemma 20, so that H = D\ + 1 + log D2 (2N / D{) . Simplifying the expression, we obtain that 
D x + 1 + (M + 4l')(H -D 1 +M + 4l')=(M + 41') log Da (2N/D l ) + (M + 4l')(M + 41' + l) + D 1 + 
l=(M + 4l')\og D2 (2N/D 1 ) + D 3 . ' □ 

Proof of Theorem [8} Fix a positive integer P and let N be a positive integer satisfying 

[(M + 4/ / )log D2 (2iV/ J D 1 ) + J D3] p <iV, (2) 



where the constants D\~D 3 are defined in Lemma |2lj Consider a factor u>[i..j] of w with 
rk[i--j] < I' and c[z..j] = N; such a factor always exists by the definitions of I' and of c[i..j]. 

By the previous lemma, the number of possible codes of decompositions of prefixes of to 
P palindromes w[i d + l..i d+1 ], d = 0, . . . , P - 1, is at most [(M + 4l')\og D2 (2N / D\) + D 3 ] p , and 
hence less than N. In particular it means that the position of the last star in the code can take 
less than N different values; but the length c[i..j] of the code of is N. So, there are two 

consecutive symbols in c[i..j], corresponding to the positions and such that the last 

star in the code of the decomposition to P palindromes can never appear between them, that 
is, that no word w[i..q], where < q < n^+i, can ever be decomposed into P palindromes. □ 



5 Discussion 



Even if we prove that the palindromic length of factors of any aperiodic word is unbounded, 
some ultimately periodic words, for example, w = (110100)^, contain factors having arbitrarily 
large palindromic lengths. So, unlike for example the result by Mignosi, Restivo and Salemi 
on repetitions and periodicity [7j, the conjectured property will not give a characterization of 
aperiodic words. 

We prove the following property of ultimately periodic words with a uniform bound on the 
palindromic length of its factors: 

Proposition 22 Let P be an integer, w an ultimately periodic word such that |u| pa i < P for 
each factor u of w. Then w has a tail w' of the form w' = v{p\p2) UJ , where p\ and p2 are 
palindromes. 

Proof. Let w' be a tail of w having period t. Consider a factor u of w' with \u\ > tP. Then u 
can be factored as u = v\Vi . . . v m with m < P and each Vi a non-empty palindrome. At least 
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one of the palindromes V{ in this factorization has length greater than t. So, t is a period of 
this long palindrome. Now the proposition follows from the well-known fact that a period of a 
palindrome has the form P1P2, where p\ and P2 are palindromes. □ 

Proposition [22] implies that if the answer to Question [T] is "no", then an infinite word w having 
a uniform bound on the palindromic length of its factors is ultimately periodic, and moreover 
its period has the form P1P2, where p\ and P2 are palindromes. 

As we have mentioned above in Remark [TJ the (k, ^-conditions for some k and I seems to be 
fulflled in particular for all morphic words, so, it is hardly probable that an example of an 
aperiodic word with bounded palindromic length of factors will be a morphic word. At the same 
time, there exist Sturmian words which do not satisfy any (k, £)-condition: these are exactly 
Sturmian words whose elements of the directive sequence are unbounded (see Chapter 2 of [6] 
for the definitions). So, this paper does not contain a proof of unbounded palindromic length of 
factors which would be valid for all Sturmian words. 

It is also clear that 



Lemma 23 If there exists an aperiodic word with bounded palindromic length of factors, then 
there exists a binary one. 



Proof. Consider an aperiodic word w on an alphabet A = {ai, . . . , a q } and for each i G 1, . . . , q 
define a coding q by Cj(aj) = 0, Cj(6) = 1 for all other symbols b G A. Consider the infinite words 
ci(w), . . ., c q (w). At least one of them, let us denote it by c(u>), is aperiodic (since otherwise w 
would be periodic with the period equal to the least common multiple of periods of ci(w), . . ., 
c q {w)). At the same time, for each factor u of w we have |ti| pa i > |c("u)lpai- So, if |u| pa i < P 
for all factors u of w and for some P, then the same is true for all factors c(u) of the binary 
aperiodic infinite word c(w): |c(u)| pa i < P. □ 



Remark 2 The proof of Lemma 23 above is valid only for the case of a finite alphabet A. 
Another proof, valid also for the infinite alphabet {0, 1, . . . , n, . . .}, was suggested by T. Hejda 
who uses the morphism c defined by c(i) = 10*1. It is not difficult to prove that |u| pa i < P for 
all factors u of w, then |v| pa i < P + 4 for all factors v of c(w). 



At last, we recall again that for the case of /c-power-free words, our proof can be extended 
in particular to privileged words instead of palindromes (see Section [3]). However, Theorem 
[8] cannot be directly extended to privileged words since in Lemma 21 we used the properties 



specific for palindromes which allowed to apply Lemma 17 
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